Classification of molecular sequence data using Bayesian phylogenetic mixture models

نویسندگان

  • E. Loza-Reyes
  • M. A. Hurn
  • A. Robinson
چکیده

Rate variation among the sites of a molecular sequence is commonly found in applications of phylogenetic inference. Several approaches exist to account for this feature but they do not usually enable us to pinpoint the sites that evolve under one or another rate of evolution in a straightforward manner. In this paper we concentrate on phylogenetic mixture models as tools for site classification. Our method does not rely on prior knowledge of site membership to classes or even the number of classes. Furthermore, it does not require correlated sites to be next to one another in the sequence alignment, unlike some phylogenetic hidden Markov or change-point models. We present a simulation study to show that our approach is able to correctly classify the sites to evolutionary classes and we analyse the popular alignment of the mitochondrial DNA of primates. In both examples, all mixtures outperform commonly-used models of among-site rate variation and models that do not account for rate heterogeneity. Our method for site classification is directly relevant to the profiling of genes with unknown function, and its application may lead to the discovery of partitions not otherwise recognised in the alignment. In addition, we discuss computational aspects including Department of Biomathematics and Bioinformatics, Rothamsted Research, Harpenden, AL5 2JQ, UK. Email: [email protected] Department of Mathematical Sciences, University of Bath, Bath, BA2 7AY, UK. Email: [email protected] Department of Mathematical Sciences, University of Bath, Bath, BA2 7AY, UK. Email: [email protected]

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of mitochondrial DNA sequences of Turcinoemacheilus genus (Nemacheilidae Cypriniformes) in Iran

Members of Nemacheilidae Family, Turcinoemacheilus genus were subjected to molecular phylogenetic analysis in this study. This genus was reported in 2009 to inhabit in Karoon River drainage, in contrary to previous assumption that it was the endemic species in the Basin of Tigris River. It was sampled from three stations placed in different tributaries in Karoon drainage and evaluated to unders...

متن کامل

PuMA: Bayesian analysis of partitioned (and unpartitioned) model adequacy

SUMMARY The accuracy of Bayesian phylogenetic inference using molecular data depends on the use of proper models of sequence evolution. Although choosing the best model available from a pool of alternatives has become standard practice in statistical phylogenetics, assessment of the chosen model's adequacy is rare. Programs for Bayesian phylogenetic inference have recently begun to implement mo...

متن کامل

Subgeneric classification of Linaria (Plantaginaceae; Antirrhineae): molecular phylogeny and morphology revisited

Linaria Mill. (Plantaginaceae) with about 160 spp. is the largest genus of the tribe Antirrhineae. We conducted phylogenetic analyses of nuclear ribosomal DNA internal transcribed spacer region (ITS) and chloroplast DNA (rpl32-trnL) sequence data to test the monophyly of currently recognized sections in Linaria. For this purpose 86 species representing seven sections of Linaria and one species ...

متن کامل

Molecular Identification of the Persian Gulf Sea Hare (Aplysia sp.) Based on 16s rRNA Gene Sequence

Background: Sea hares of the Aplysia genus are among the mollusks of interest for various researchers to study their phylogeny, bioactive compounds and the nervous system. These mollusks are herbivorous and produce chemical compounds (ink) to defend themselves. The present study provided molecular identification of the Persian Gulf (Bushehr city) sea hare using 16s rRNA gene sequence. Materials...

متن کامل

Assessment of relationships between Iranian Fritillaria (Liliaceae) Species Using Chloroplast trnh-psba Sequences and Morphological Characters

The genus Fritillaria comprises of 165 taxa of medicinal, ornamental and horticultural importance. Evolutionary relationships in this genus is an interesting research area, attracting many researchers. In this study, phylogenetic relationships among 18 native to endemic species in Iran belonging to four subgenera Petilium, Theresia, Rhinopetalum and Fritillaria, are assessed using chloroplast t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 75  شماره 

صفحات  -

تاریخ انتشار 2014